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ABSTRACT 

The Atacama Cosmology Telescope (ACT) is currently observing the cosmic microwave background with ar- 
cminute resolution at 148 GHz, 218 GHz, and 277 GHz. In this paper, we present ACT’s first results. Data have 
been analyzed using a maximum-likelihood map-making method which uses B-splines to model and remove the 
atmospheric signal. It has been used to make high-precision beam maps from which we determine the experiment’s 
window functions. This beam information directly impacts all subsequent analyses of the data. We also used the 
method to map a sample of galaxy clusters via the Simyaev-Zel’dovieh (SZ) effect, and show five clusters previously 
detected with X-ray or SZ observations. We provide integrated Compton-v measurements for each cluster. Of par- 
ticular interest is our detection of the z = 0.44 component of A3 128 and our current non-detection of the low-redshift 
part, providing strong evidence that the further cluster is more massive as suggested by X-ray measurements. This 
is a compelling example of the redshift-independent mass selection of the SZ effect. 

Subject headings: cosmic background radiation - cosmology: observations - galaxies: clusters: general - methods: 
data analysis 



2 


HINCKS ET AL. 


1. INTRODUCTION 

A new generation of experiments is measuring the cosmic mi- 
crowave background (CMB) at arcminute resolutions. Within 
the past year alone, results from the South Pole Telescope 
(Staniszewski et al. 2009), ACBAR (Reichardt et al. 2009a), 
AMiBA (Umetsu et al. 2009), APEX-SZ (Reichardt et al . 
2009b), the Cosmic Background Imager (Sievers et al. 2009), 
the Sunyaev-Zel’dovich Array (Sharp et al. 2010), and QUaD 
(Friedman et al. 2009) have revealed the ~arcminute structure 
of the CMB with higher precision than ever. The angular power 
spectrum of temperature fluctuations at these scales (£ > 1 000) 
will further constrain models of the early universe. Furthermore, 
secondary features such as the Sunyaev-Zel’dovich (SZ) effect 
and gravitational lensing probe the growth of structure. 

With its first science release, the Atacama Cosmology Tele- 
scope (ACT) now adds to these endeavors. A 6 m, off-axis Gre- 
gorian telescope, it was commissioned on Cerro Toco in north- 
ern Chile in 2007 October. Its current receiver is the Millimeter 
Bolometer Array Camera (MBAC), containing three 32x32 ar- 
rays of transition edge sensor (TES) bolometers observing at 
central frequencies of 148 GHz, 21 8 GHz, and 277 GHz, with 
beam full-widths at half-maxima (FWHM) of 1.'37, l.'Ol, and 
0 .' 9 1 , respectively (see Section 3.2, below). It has operated for 
three seasons and is currently in its fourth season. In 2007 
one month of science observations was made using only the 
148 GHz array. The other two frequencies were added for the 
2008 season, which lasted about 3.5 months. The telescope op- 
tical design is described in Fowler etal. (2007). Hincksetal. 
(2008) and Switzer et al. (2008) report on the telescope perfor- 
mance and provide an overview of hardware and software sys- 
tems. The MBAC design and details of TES detector proper- 
ties and readout are in Niemack (2006), Marriage et al. (2006), 
Battistelli et al. (2008), Niemack etal. (2008), Swetz et al. 
(2008), Thornton et al. (2008), and Zhao et al. (2008). 

ACT is located at one of the premier sites for millimeter as- 
tronomy because of the high altitude (5200 m) and the dry at- 
mosphere. The precipitable water vapor (PWV) had a median 
value of 0.56 mm during the nights of our 2008 season. Never- 
theless, atmospheric emission remains the largest signal external 
to the receiver in our raw data, a reality for any ground-based 
millimeter-wave telescope. The atmospheric power dominates 
only at low temporal frequencies and this is the main reason 
we observe while scanning our telescope in azimuth. Though 
much of the atmospheric power is below the frequency of our 
0.0978 Hz scans, on typical nights the atmosphere dominates the 
detector noise up to about 2 Hz. 

In this paper we present a map-making method designed to 
model and remove the atmospheric signal in a manner which 
is unbiased with respect to the celestial signal. The method — 
which is independent of the map-making pipeline used for most 
other ACT analysis — currently produces its best results on small 
scales (< 1°), so it is well-suited to making maps of objects 
with small angular sizes. One of the most useful applications 
has been the study of our instrumental beam with high signal- 
to-noise maps of planets. The beam profile affects all aspects 
of data analysis, including calibration, and we provide the beam 
characteristics in this paper. Additionally, we present new SZ 
measurements of five known clusters. 

We proceed as follows: In Section 2 we introduce the map- 
making method, showing both the theory and some qualitative 
properties; Section 3 describes how we analyzed our beams, and 
presents the key measured parameters along with beam maps 

Reprints of all the references in this paragraph may freely be downloaded 
from : h ttp://www.phys ics .princeton. edu/act/papers .htm 1. 


and radial profiles; window functions are derived in Section 4; 
Section 5 shows a selection of clusters imaged with the mapper; 
and we conclude in Section 6. 

2. THE C.OTTINGHAM MAPPING METHOD 

In this section, we present a technique for removing the atmo- 
spheric power first described by Cottingham (1987) and used 
by Meyer et al. (1991), Boughn et al. (1992), and Ganga et al. 
(1993). The temporal variations in atmospheric signals are mod- 
eled using B-splines, a class of functions ideal for interpola- 
tion, discussed more below. The technique computes maximum- 
likelihood estimates of both the celestial and the atmospheric 
signals, using all available detectors in a single frequency band. 
We refer to it hereafter as the Cottingham method. 

In the following subsections, we give a mathematical descrip- 
tion of the Cottingham method (Section 2.1), followed by a 
discussion of its benefits and a comparison to the “destriping” 
method developed for Planck, which has close similarities (Sec- 
tion 2.2). Our approach for including the effects of spatial vari- 
ability across the detector arrays is in Section 2.3. We discuss 
the use of B-splines in Section 2.4, and finish by outlining our 
implementation of the method (Section 2.5) and map-making 
steps (Section 2.6). 

2.1. The A Igorithm 

The measured timestream d is modeled as a celestial signal 
plus an atmospheric component: 

d=Pm + Ba + n, (1) 

where the pointing matrix P projects the celestial map m into 
the timestream, B is a matrix of basis functions with amplitudes 
a which model the temporal variation of atmospheric power, 
and n is the noise. The timestream of measurements d may be 
a concatenation of multiple detectors if they have been properly 
treated for relative gain differences. Throughout this paper, this 
is the case: all working detectors from one frequency band are 
processed simultaneously. 

We seek a and m. estimates of the atmospheric amplitudes 
and the celestial map, respectively. Equation ( 1 ) prescribes that 
we subtract the atmospheric term to obtain the map estimate: 
d' = d-Bot. The maximum-likelihood estimator is then given 
by the standard map-making equation (e.g., Tegmark 1997): 

;«= (P 7 'N“ , P) _I P 7 'N^r/ , = n( £ /-Ba), (2) 

where we call 

n= (P r N“‘P)“' P r N“' (3) 

the projection matrix and N = ( nn T ) is the noise covariance. 
The projection matrix II is designed in such a way that the map 
estimate is not biased, in the sense that the error, m—m, docs not 
depend on m. 

Given a set of basis functions B, the Cottingham method min- 
imizes the variance of the map pixel residuals with respect to the 
amplitudes a. The residuals are the differences between the ce- 
lestial signals measured in the timestream and the map estimate: 

Ad = d' - Pin = </-Bq- PIT (rf-BS) 

= (I-PII)(y-BS), (4) 

where 1 is the identity matrix. We differentiate y 2 : 
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= ~ (M T W'Ad) 
act dot 

= -2B r (1 - pnf N-' ( 1 - pn) (d- Ba) 

= -2B 7 N“‘ (1 - PIT) (d- Ba) . (5) 

The last equality can be obtained by expanding II to its con- 
stituent elements (cf. Equation (3)) and simplifying. If we de- 
fine the following: 

S = B r N“‘(l-Pn), 0 = SB. <p=~d, (6) 

then when we set the derivative in Equation (5) to zero, we have 
the simple expression: 


05 = 4>. (7) 

This is a linear equation which is straightforward to solve for the 
atmospheric basis function amplitude estimates 5. These can 
then be used in Equation (2) to estimate the map. In fact, both 5 
and m are the maximum-likelihood estimators of the atmosphere 
and celestial map, respectively, for a given set of basis functions 
B. We show this explicitly in Appendix A. 

There is an arbitrary overall offset to the computed B5 which 
must be estimated to remove the background from maps. We 
return to this point in Section 2.6. 

2.2. Discussion 

The chief strength of the Cottingham method is that it esti- 
mates the atmospheric power in a way that is unbiased with re- 
spect to the map estimate itself. This important but subtle point 
is encapsulated in the term (1 - PEI) in Equation (5), whose ef- 
fect is to project out the map estimate from the data. Therefore, 
the solution to Equation (7) is not sensitive to the estimated ce- 
lestial temperature, but only to a time-varying term which is rep- 
resented by the atmospheric estimate B5. This should be con- 
trasted with high-pass filtering or fitting a slowly-varying func- 
tion to the timestream to remove low-frequency power. Such 
approaches require masking of high signal-to-noise celestial ob- 
jects (such as planets or clusters) and/or multiple iterations to 
prevent corruption of the maps. Simulations are required to 
understand the effects of these time-domain filters on the final 
maps. 

The Cottingham method has close similarities to the “de- 
striping” technique developed in particular for Planck analysis 
(Delabrouille 1998; Burigana et al. 1999; Maino et al. 2002). In 
fact, the linear algebra presented in Section 2,1 is identical to 
some versions of destriping (e.g., Keihanen et al. 2004). The de- 
striping techniques are intended primarily to remove 1 / / instru- 
mental noise — thus, for example, Keihanen et al. (2005) impose 
a prior on the estimate Ba based on detector noise. Sutton et al. 
(2009) also consider the effects of imposing a prior on the atmo- 
spheric noise. On the other hand, we use the Cottingham method 
to remove atmospheric power with a flat prior. A distinct feature 
of our method is that we process multiple detectors simultane- 
ously since the atmospheric signal is common across detectors 
(see, however, Section 2.3). Further, our approach differs in that 
it uses B-splines as the basis for modeling the atmosphere (Sec- 
tion 2.4). 


In fact, we know that there is also spatial structure in the atmo- 
sphere, meaning that in principle, each detector might see a dif- 
ferent atmospheric signal. In practice, the finite telescope beam 
sets a lower limit on the spatial scale. We find that the atmo- 
spheric signal is coherent across a quarter to a third of the array, 
or about 5-7 ' . For reference, our 148 GHz channel, which has a 
1 137 FWHM in the far-field (Section 3.2), is sensitive to an an- 
gular size of approximately 10' at a 1 km distance, roughly the 
distance to a typical turbulence layer in the atmosphere when 
pointed at 50° in altitude (Perez-Beaupuits et al. 2005). 

To account for this, we divide the 32 x 32 detector array into 
nine square sub-arrays of roughly equal size and fit for nine 
separate temporal atmospheric signals B,a s , with the subscript 
s denoting the sub-array. These can all be done simultaneously 
if we adapt Equation ( 1 ): 
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= Pw + SB'a , + «, (8) 

where S is a book-keeping matrix that remembers from which 
sub-array each measurement in d came. The Cottingham 
method proceeds exactly as before, except that we change B -*• 
SB' and a — »■ a'. 

2.4. The B-spline as a Model of Atmospheric Signal 

We follow Cottingham (1987) in choosing cubic basis B- 
splines for the basis functions B. B-splines are widely used 
in the field of geometrical modeling, and numerous textbooks 
cover them (e.g., Bojanov et al. 1993; de Boor 2001; Schumaker 
2007); here we summarize basic properties. Basis B-splines 
are a basis of functions whose linear combination is called a B- 
spline. The basis B-splines are fully determined by a knot spac- 
ing 7> and a polynomial order p; a B-spline is flexible on scales 
larger than 7 >, while on smaller scales it is relatively rigid. The 
basis B-splines bj p (t) of order p are readily evaluated using the 
Cox de Boor recursion on the polynomial order. For m knots 
{tj} with j = 0 to m- 1: 


«={: 


bj,p(t)~- 


t-t, 


°j-P~ i 


( 0 - 


t j+p+\ t 
t j+p+i *y+ 1 


bj+\.p-\(t), (9) 


with j values restricted so that j+p+l <m-l. For m knot 
times, m+p- 1 basis B-splines cover the interval between the 
first and the last knot time. The individual basis B-splines bj. p (t) 
are compact functions, such that the B-spline receives support 
from no more than p of its bases at any point. For modeling the 
atmospheric signal, we always choose knots uniformly spaced 
in time and use p = 3 (cubic). 

Due to their flexibility on large scales, B-splines are ideal for 
modeling the slowly varying atmospheric signal. The frequency 
fk below which power will be removed is determined by the knot 
spacing t>. Empirically, we find: 


/*« 1/271. 


( 10 ) 


2.3. Spatial Structure in the Atmosphere 

The Cottingham method as presented thus far assumes that 
the atmospheric signal Ba is common among all the detectors. 


Figure 1 shows an example of atmospheric estimation us- 
ing the Cottingham method. The B-spline knot-spacing is = 
0.25 s, chosen for this example because it has fk = 2 Hz, the ap- 
proximate frequency at which the atmospheric power meets the 
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Frequency (Hz) 

Fig. 1 . — Example of the Cottingham method. The fit is done using 300 s 
of data from 605 148 GHz detectors. The PWV was 0.8 mm. about 0.25 mm 
higher than the median in 2008. The knot spacing is 0.25 s and the order is 
cubic. In both plots, the original signal is plotted with a solid, light line, the 
B-spline atmosphere model with a dashed line and the signal minus the model 
with a solid, dark line. Each plot has been smoothed with a five sample boxcar 
filter for readability. The temperature units are with respect to a Rayleigh-.feans 
spectrum. Top: a portion of one of the detectors’ timestreams. Bottom: the 
spectral densities for the same single detector. A Welch window was applied 
before computing the Fourier transform. 

detector noise level. Longer knot spacings produce qualitatively 
similar results, except that they cut off at lower frequencies, as 
per Equation (10). 

The Cottingham method is effective at suppressing atmo- 
spheric contamination, but some covariance between atmo- 
spheric and celestial map estimates remains. This is typically 
at harmonics of the scan frequency (« 0.1 Hz), as exemplified 
in the bottom panel of Figure 1 . For this work we have used 
the white noise approximation for the detectors (N = 1), which 
results in maps of bright sources which are clean down to the 
-40 dB level in most cases (Section 3). The small residual 
atmospheric-celestial covariance is manifested as striping along 
lines of constant altitude since with our 7° peak-to-peak, 1 ?5 s ' 
azimuthal scans (or 4.°47 at 0.°958 s“ ! when projected on the sky 
at our observing altitude of 50.°3), the knot spacing (n = 1 ,0s 
for beam maps (Section 3) and 0.5 s for cluster maps (Section 5)) 
corresponds to an angular scale smaller than the scan width. 
When noted, we fit straight lines to rows of pixels in the map, 
after masking out any bright source, and subtract them. We call 
this process, which takes place in the map domain, “stripe re- 
moval”. In both our beam analysis and our cluster studies, we 
have done tests which show that the bias introduced by this pro- 
cess is not significant — see Section 3.2 and Section 5.3. Nev- 
ertheless, future extensions of the Cottingham method would 
benefit from the full treatment of the noise covariance. 

2.5. Implementation 

Before making maps with the Cottingham method, some pre- 
processing must be done. The data from all detectors, which are 


sampled at 400 Hz, arc divided into fifteen-minute time-ordered 
data (TOD) files and the preprocessing is performed on each in- 
dividual TOD — a future paper will describe the steps which we 
only summarize here. The data acquisition electronics’ digital 
anti-aliasing filter as well as measured detector time constants 
are deconvolved from the raw data. Low frequency signal due 
to cryogenic temperature drifts is measured with dark detectors 
(i.e., detectors uncoupled to sky signal) and removed from sig- 
nal detectors; a sine wave with period 10.23 s is also fit to each 
timestream and removed to reduce scan-synchronous contami- 
nation. Calibration to units of power uses nightly load curves 
obtained by sweeping through detector bias voltages and mea- 
suring the response. Relative gain imprecisions are removed 
by using the large atmospheric signal itself to flat field the de- 
tectors (e.g., Kuo et al. 2004); this is done independently in 
each of the nine sub-arrays (see Section 2.3). Finally, cali- 
bration to temperature units uses measurements of Uranus, for 
which we use a brightness temperature of 1 12 K with 6% uncer- 
tainty (Griffin & Orton 1993; Marten et al. 2005; Kramer et al. 
2008). (The beam maps require no calibration to temperature — 
in fact, the temperature calibration is obtained from them.) The 
timestreams require no further preprocessing. 

To improve the speed of the Cottingham algorithm, we exploit 
the fact that the map pixelization used for calculating the atmo- 
spheric signal (Equations (5)-(7)) need not be the same as the 
map-making pixelization. In general we only use a selection of 
the possible pixels on the map; additionally, we down-sample 
the number of hits in each pixel. We call the former “pixel 
down-sampling” and denote the fraction of retained pixels n p ; 
the latter we term “hit down-sampling” and denote the fraction 
of retained hits «/,. Consequently, the fraction of total available 
data used is n p x m,- Each of these down-samplings is done in an 
even manner such that all working detectors are used, and also 
such that no large gaps exist in the remaining timestream. 

We have specified four parameters for the Cottingham 
method: the knot-spacing 7>, the pixel size £, the pixel down- 
sampling fraction, n p , and the hit down-sampling fraction, n h . 
Of these, we always choose £ = 18" (about 1/3 the 277 GHz 
beam size). 

To evaluate the effect of varying the other three variables, we 
define a figure of merit which compares the average detector 
spectral density below 1 Hz to the white noise level, calculated 
in the range 5-25 Hz: 


L 



flHz ,, / f 25Hz ,, ’ 

Jo Hz d f / Js Hz d t J 


HD 


where the sum runs over the Nd detectors used for the Cotting- 
ham calculation and G, is the spectral density of the /th detector 
after removing the estimated atmosphere signal. Figure 2 shows 
a plot of measured values of L for a selection of knot-spacings 
and pixel down-samplings. In our tests, we independently varied 
n p and «/,, and found that the product n p x «/,, used in Figure 2, 
captures the important trend in the ranges of interest. As ex- 
pected, shorter knot spacings remove more power: note however 
that only the 7> = 0.5 s and r k = 0.25 s are capable of removing 
power up to the 1 Hz for which L is defined (cf. Equation ( 1 0)). 
Because the curves flatten out as n p nh increases, at a certain 
point adding more data does not substantially improve the fit. 
This supports our conclusion that we only need to use a fraction 
of the data to estimate the atmosphere. 

The timing data in the lower panel of Figure 2 were measured 
on a 64-bit Intel Xeon® 2.5 GHz processor. Computation time 
is dominated by the calculation of the variables in Equation ( 6 ). 





ACT: BEAM PROFILES AND FIRST SZ CLUSTERS 


5 


Approx. Nutn. Data Points Per Second 



0.01 0.1 1 
Fraction of Total Data (n p np) 


Fig. 2. — Top panel: the ratio L of low-frequency power to white noise after 
removing the atmospheric signal, as defined by Equation (11). All data points 
are from the same TOD using all working detectors; only the parameters 7>, n p 
and «/, (see the text) were varied to obtain each point. The x-axis is a product 
of the resamplings tip and tip, and the colors are different knot spacings n as 
indicated in the plot key. Bottom panel: the computation time required for the 
data shown in the top panel. (The segmented lines are an artifact of how the data 
were recorded.) As the fraction n p np of total data used increases, the efficacy 
L of power removal flattens out and adding more data does not significantly 
improve the fit but only takes more computation time. In this example, there 
were 103000 possible map pixels with an average of 287 hits pixel -1 . 

In general, these go linearly with the number of data points d and 
quadratically with the number of basis functions B; in the case 
of B-splines, the compact support of the bases can be exploited 
so that the quadratic rate is subdominant to the linear, as the plot 
shows. 

The number of basis B-splines is small enough that it is actu- 
ally feasible to solve Equation (7) exactly. For most cases, how- 
ever, the conjugate gradient method (e.g., Press et al. 1992, p. 
83 ff) is much faster and yields indistinguishable results. There- 
fore, we use the latter in our implementation. 

2.6. Map-making 

Once the atmospheric model Ba has been calculated for a 
TOD with the Cottingham method, we create its celestial map. 
Maps are made in (Am A A cos (a)) coordinates, where A a and 
A A are the distances from the altitude and azimuth of the map 
center. 3 " We use a pixel size of 10."6 per side, about 20% the 
size of the 277 GHz beam; note that this is different from the 
pixel size f used for calculating the atmosphere (Section 2.5). 
In this paper, the region of sky for which the atmospheric model 
is calculated is much wider in the azimuth than in altitude, since 
the telescope scans — 4?47 along the A A cos (a) axis — are much 
wider than the distance the sky rotates along the A a axis as the 
object of interest passes through the field of view. However, 
all of the maps we present are cropped to disks with 1:1 aspect 
ratios, centered on the objects being mapped. 

We make the white noise approximation for each detector and 
weight it by the inverse of its variance in the map estimate. The 
detector variances are obtained iteratively: we make a map with 
equal detector weights and measure the variances of individual 
detector maps against the total map, remake the map with the 
new variances and repeat until the total map variance converges. 
The atmosphere estimates returned by the Cottingham method 
have arbitrary offsets, which can be different for the nine sub- 
arrays we use (see Section 2.3). Thus, in the same iterative pro- 

j3 This is a very good approximation to the Gnomonic projection for the small 

map sizes we use. 


cess, we also fit for the sub-array offsets and remove them when 
coadding detectors. 

Coaddition of TOD maps is done after all of the steps de- 
scribed above. The inverse variance of each map (calculated 
after masking bright point sources or clusters) is used as its me- 
dian weight, and the relative weights of its pixels are given by 
the number of hits per pixel. 

Finally, we mention that the software used for the results in 
this paper has a completely independent pipeline from our main 
map-making software which solves for the frill survey area cov- 
erage. It has been especially useful for studying and optimizing 
the signal extraction in small, targeted regions, and has provided 
important double-checks for our other pipeline. 

3. BEAM MAPS AND PROPERTIES 

Understanding the telescope beams, or point-spread func- 
tions, is of primary importance for the interpretation of our maps 
since they determine the relative response of the instrument to 
different scales on the sky and are central to calibration. For 
ACT’s measurement of angular power spectra, the Legendre 
transform of our measured beam profile, called a window func- 
tion, determines the response of the instrument as a function of 
angular scale. 

Planets are excellent sources for measuring the telescope’s 
beam because they are nearly point sources and are brighter than 
almost any other celestial object. The best candidates for ACT 
are Saturn and Mars; of the rest, Jupiter is too bright and satu- 
rates the detectors, Venus is available too near to sunrise or sun- 
set when the telescope is thermally settling, and the others are 
too dim for exploring the far sidelobes of the beam. (However, 
Uranus is useful as a calibrator — see Section 2.5.) The beam 
maps presented in this section are from observations of Saturn, 
which was available from early November through December 
of 2008. The possible concern of detector non-linearities when 
observing Saturn is obviated by the fact that atmospheric fluctu- 
ations, which are of the same order of magnitude in brightness 
as Saturn, are found to produce linear detector responses. 

3.1. Data Reduction 

Maps were made for each night-time TOD of Saturn, using 
the Cottingham method with t> = Is, n p = 0.32 and «/, = 0.36. 
We had more TODs than were needed to make low-noise beam 
maps, and we excluded about 1/3 of the maps which had higher 
residual background contamination, manifested by beam pro- 
files that significantly diverged from those of the cleaner maps. 
The map sizes and number of TODs per frequency band are 
shown in Table 1 . 

In the analysis of Section 3 . 2 , below, we sometimes compare 
unprocessed maps to stripe-removed maps. Stripe-removed map 
have been treated as outlined in Section 2 . 4 , while unprocessed 
maps have not been altered after map-making except for the sub- 
traction of a background level. The planet is masked out before 
calculating the mean map value which estimates this level. The 
mask sizes for the three arrays, whether used for stripe removal 
or background estimation, are listed in Table 1. 

In each frequency band, the selected TOD beam maps were 
coadded. Weights were determined from the rms of the mean 
background level, calculated outside the mask radius. Relative 
pointing of individual detectors was measured to sub-arcsecond 
precision using the ensemble of Saturn observations. The over- 
all telescope pointing was determined from each planet obser- 
vation prior to map-making and used to center each TOD map, 
so recentering of the maps was unnecessary before coaddition . 




6 


HINCKS ET AL. 



FIG. 3. — Beam maps for the three frequency bands: from left to right, 148 GHz, 21 8 GHz and 277 GHz. The maps are from coadded observations on 1 1 -15 nights 
(see Table 1) and have radii of 21' (148 GHz) and 15' (218 GHz and 277 GHz). Maps are normalized to unity and contours are in decrements of-lOdB. A histogram 
equalized color scale is used to highlight the fact that we have made < -40dB beam measurements of our 148 GHz and 218GHz bands. (Negative values are due 
to noise about the mean background level.) Even in the inferior 277 GHz map, striping is still below -20dB. The disks above the color bar show the sizes of the 
beam FWHM for each band (see Table 1). A Gaussian smoothing kernel W'ith a = 0154 has been applied to highlight large-scale structure; smoothing is not otherwise 
performed in the analysis. No stripe removal has been done on these maps. 


3.2. Beam Measurements 

Figure 3 shows coadded beam maps for the three arrays us- 
ing a color scale which highlights the features in the sidelobes. 
The 148 GHz and 2 18 GHz maps have striking similarities, most 
notably along the altitude (or vertical) direction where both ex- 
hibit more power near the top of the map. This is due to the 
off-axis design of the telescope (see Fowler et al. 2007): since 
the 148 GHz and 2 18 GHz arrays sit at about the same vertical 
offset from the center of the focal plane, their resemblance along 
this axis is expected. Note that we recover structure in these map 
at a < -40 dB level. The 277 GHz map is clearly inferior, show- 
ing residual striping in the scan direction, although this occurs 
below -20 dB. We believe that this is from a combination of the 
brighter atmosphere at 277 GHz, as well as detector noise corre- 
lation induced by large optical loads (such as Saturn), which we 
are still investigating. Nonetheless we are still able to measure 
the 277 GHz solid angle to about 6% (see below), and work is 
underway to improve it. 

For the 148 GHz and 218 GHz arrays, we do our beam anal- 
ysis on maps which have not had stripe removal because this 
process removes the real vertical gradient from the maps (see 
Figure 3). Nonetheless, the solid angles (see below) from stripe- 
removed maps are within la of the values from unprocessed 
maps. On the other hand, the larger residual striping in the 
277 GHz maps necessitates the use of stripe-removed maps. 

The beam center is characterized by fitting an elliptical Airy 
pattern — the function describing the beam of an optical system 
with a perfect aperture — to the top ~3 dB of the beam map. This 
provides a measurement of the location of the beam center, its 
FWHM along the major and minor axes of the ellipse, and its 
orientation, which we define as the angle of the major axis from 
the line of zero altitude relative to the beam center. The uncer- 
tainties in these parameters are determined using the bootstrap 
method (Press et al. 1992, pp. 691ff) and give errors consistent 
with the standard deviation of values measured from the indi- 
vidual TOD maps. The FWHM and angles are listed in Table 1 . 
They are included for reference but are not used in any analysis. 

We denote the beam map by B(0,d>), where we use coordi- 
nates with radial distance 0 from the beam center and polar an- 


TABLE 1 

Summary of Beam Parameters 



148 GHz 

218GHz 

277 GHz 

Map properties (Section 3.1) 



No. of TODs 

16 

15 

1 1 

Stripe removal? 

no 

no 

yes 

Map radius (') 

21 

15 

15 

Mask radius (')“ 

18 

9.11,13 

6,8,10.12 

Beam centers (Section 3.2) 




Major FWHM (') 1 .406 ± 0.003 

1.006 ±0.01 

0.94 ±0.02 

Minor FWHM (') 1 .344 ± 0.002 

1.001 ±0.003 

0.88 ±0.02 

Axis angle (°) 

62 ±2 

1 37 ± 9 

98 ± 13 

$ir wing fits (Section 3.2) 




Fit start, B\ (') 

7 

5 

4.5 

Fit end, 0 2 (') b 

13 

7-11 

6-10 

Best-fit On- (') 0.526 ±0.002 

0.397 ±0.01 

0.46 ±0.04 

Solid angles (Section 3.2) 




Solid angle (nsr) 

218.2 ±4 

1 18.2 ±3 

104.2 ±6 

Percent interpolated 

2.8 

4.3 

7.2 

Beam fits (Section 4.1) 




0o (' ) 

0.2137 

0.1562 

0.1367 


Note. — See the text for definitions of these parameters and how they 
are measured. Values for the 148 GHz and 21 8 GHz bands are obtained 
from the coadded, unprocessed maps whereas the 277 GHz values are av- 
erage values from stripe-removed maps with the mask sizes indicated in 
the table. 

a The 218 GHz and 277 GHz beam properties are averaged from the re- 
sults at these mask radii — see the text. 

b The fit ranges for the 2 1 8 GHz and 277 GHz band are varied along with 
the mask radii so that O 2 is never larger than the mask — see text. 


glc d>. By definition, B( 0,0) ~ 1. The symmetrized beam is 
averaged around the polar angle: 


( 12 ) 
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Another quantity of interest is the accumulated solid angle, 
which measures the total normalized power within a given ra- 
dius: 

2tt 9 

0.(6) = j d<j>' J O'dO'BiO', <p'). (13) 

o o 

The beam solid angle is Oa = 0(9 = ir). 

Figure 4 shows measured beam profiles and accumulated 
solid angles for the three arrays. We measure the beam profiles 
down to about -45 dB. If the beams exactly followed an Airy 
pattern, these data would account for 98% of the solid angle. 
Since systematic effects could corrupt our maps at the largest 
radii, we seek a way to robustly estimate the last few percent of 
the solid angle on each beam. The method is to extrapolate the 
data with a fit to the asymptotic expression for the Airy pattern: 

b\o»e F )=(^j , (14) 

where 0 F is the beam FWHM and 0 W defines the wing scale. 
Equation (14) is good to better than 1% beyond about 5 By 
(Schroeder 2000, Section 10.2b). ’ 4 Knowledge of 9w allows 
us to infer the amount of unaccounted solid angle beyond the 
map boundary. A simple integration shows that the solid angle 
beyond a radius 9 * is: 

Off (6 > 9h) = ( 15 ) 

Mb 

We can also use this expression to estimate the amount of true 
beam power which was “mistakenly” included in the measure- 
ment of the background level outside the mask radius and sub- 
tracted from the map. In our analysis of the beam profiles and 
solid angles (including those displayed in Figure 4), we use the 
fits of 9w to calculate this missing power and add it back into 
the map. A new 9w is then calculated from the corrected map; 
after two such iterations the 9w fit converges. 

Figure 4 includes over-plots of the wing estimates from the 
best-fit values of (hv on unprocessed maps. We denote the radii 
between which the fits were performed as 0 \- 8 i, and choose 
9\ « 59 p for each array. For 148GFIz, we obtain good fits for 
any choice of 62 up to 13', or about the -40 dB level in the pro- 
file. Thus, we use 62 = 1 3', for which we fit with y 2 of 40 for 35 
degrees of freedom. The fits to the other profiles are not as ro- 
bust: 218 GHz has a reduced-^ 2 of 2.8 for 62 = 1' and 277 GHz 
has reduced-\ 2 of25 for 62 =6'. Larger $2 gave poorer fits. Con- 
sequently, for these profiles we calculate 8 w at different mask 
sizes, as indicated in Table 1 . At each mask size we varied 62 in 
2' increments, always keeping it lower than the mask size. The 
average value from the whole ensemble of fits gives us Qw and 
we take its standard error as the uncertainty. Although Equa- 
tion (14) may be too simple a model for these profiles, contribu- 
tions to the solid angle at these radii are only a few percent of 
the total solid angle, which has an uncertainty dominated by the 
contribution of the beam at radii less than 82 — see below. The 
values of 0 H - for all three beam profiles are listed in Table 1. 

Our Gw fits allow us to calculate precise solid angles. At radii 
smaller than 62 , we integrate the normalized power in the map 
(cf. Equation ( 1 3)). Beyond 9 2 , we use Equation (1 5) to ex- 
trapolate the remaining solid angle. (In the case of the 2 1 8 GHz 

’ 4 In full generality Equation (14) is also proportional to cos(7uDf?/A-37r /4), 
where D is the telescope aperture diameter and A is the wavelength; we have 
smoothed over cosine cycles. 


and 277 GHz solid angles, we use the smallest 9i and the largest 
mask size in the ranges shown in Table 1 . Other choices from 
these ranges do not significantly alter the results.) Finally, in the 
approximation that Saturn is a solid disk, it adds half of its solid 
angle Os to the measured instrument solid angle — this is shown 
in Appendix B. Thus, the total solid angle is: 

0 A = 0(9 < 62 ) + O w (9 > 62 ) - O s /2. (16) 

During the period of our observations, Saturn subtended solid 
angles from 5.2 to 6.0 nanosteradians (nsr). We use the mean 
value of 5.6 nsr. 

Determining the rest of the uncertainty in the solid angle is not 
straightforward since systematic errors dominate. For our total 
error, we add the estimated uncertainties of each of the terms on 
the right-hand side of Equation (16) in quadrature. The uncer- 
tainty from Saturn’s solid angle we take to be 1 nsr, both because 
of its varying angular size and to account for any systematic er- 
ror due to the disk approximation. The uncertainty of O w is 
derived from the error of the fitted 6 \v - For 0(6 < 82 ), which 
dominates, we estimate the error by looking at the distribution 
of values from the individual TOD maps which comprise the 
coadded map. We did this in two ways. First, we calculated 
the mean and standard deviation of the solid angles measured in 
each individual map. This also reassures us that the coaddition 
step does not introduce any systematic error through, for exam- 
ple, pointing misalignments or changes in telescope focus from 
night to night. Second, we used the bootstrap method to gener- 
ate 1 000 coadded maps with random subsets of individual maps 
and used this ensemble to estimate the 68th percentile (i.e., 1 a) 
of solid angles. These two error estimates were consistent with 
each other. 

The solid angles and their uncertainties are reported in Ta- 
ble 1 . The formal uncertainties have been doubled and we quote 
them as 1 a, in case there are systematic effects for which we 
have not accounted. In particular, the maps used for power spec- 
trum estimation will come from an independent pipeline and 
will treat the instrumental response in slightly different ways — 
for example, by weighting detectors differently. We expect the 
beam uncertainties to decrease as our analysis evolves. 

4. WINDOW FUNCTIONS 

The statistics of the CMB are frequently characterized by an 
angular power spectrum Cf 

A T(ri) / blffy . ) - (12) 

£,m 

where A T(h) is the CMB temperature at position h and Yf m is 
a spherical harmonic. In spherical harmonic space, the beam is 
encoded in a window function w e describing the response of the 
experiment to different multipoles t, such that the total variance 
of a noiseless power spectrum is: 

Var=y (18) 

e 

In the case of a symmetric beam, the window function is 
the square of the Legendre transform of the beam radial profile 
(White & Srednicki 1995; Bond 1996): 

35 The rings of Saturn add a layer of complication to its solid angle calcula- 
tion, particularly since they have a different temperature than the disk. The ring 
inclination was low during our observations (< 6°) and we have estimated that 
their contribution is negligible within the error budget. 
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Fig. 4. — The beam profiles (Equation (12)) and accumulated solid angles (Equation (13)) for the three arrays, calculated from coadded maps (see text). The beam 
profiles are shown for both unprocessed maps, with dark errorbars, and stripe-removed maps, with light errorbars. Over-plotted on each profile is the best fit of 0 \\ 
(Equation (14)) to the unprocessed beam profiles. The error on the profiles are standard errors from the azimuthal average. The accumulated solid angles are from the 
unprocessed maps (without any solid angle extrapolation via Equation (15)) for 148 GHz and 2 18 GHz, and from the stripe-removed map for 277 GHz. Saturn is bright 
enough that the rms power from the CMB falls below all the points in these plots. 


w £ = b\, b s (0)P e (O)d(cos6). (19) 

4. 1 . Basis Functions 

For calculation of the window function and its covariance we 
model each beam with a set of basis functions which is complete 
but not necessarily orthogonal: 

w max 

b s (e)=J2 a >M0)- ( 20 ) 

n - 0 

Because the beam is truncated by a cold Lyot stop 
(Fowler et al. 2007), its Fourier transform is compact on a disk, 
which suggests that a natural basis with which to decompose 
the Fourier transform of the beam image is the set of Zemike 
polynomials that form an orthonormal basis on the unit disk 
(Bom & Wolf 1999). The Zemike polynomials, expressed in 
polar coordinates p and ip on the aperture plane, are: 

V n m (p, < p) = RZ(p)e im *, (21) 


as our set of basis functions to fit the radial beam profile. ,6 Here, 
we have introduced a fitting parameter, 9 0 , to control the scale 
of the basis functions. 

4.2. Fitting Basis Functions to the Beam Profile 

Below 6\ (cf. Table 1), we fit the bases b n of Equation (24) 
to the measured beam profile, and beyond 9\, we use the power 
law defined in Equation ( 1 4) with the parameters 9w listed in 
Table 1 . We assume vanishing covariance between the power 
law and the basis functions as they are fitted to independent sets 
of data points. 

We employ a nonlinear, least-squares method to solve for the 
coefficients a„ and their covariance matrix C a fi„ . The algorithm 
uses a singular value decomposition to determine if the basis 
functions accurately characterize the data and also computes 
a goodness-of-fit statistic (Press etal. 1992, Section 15.4). As 
inputs to the fitting procedure we are required to specify the 
scale parameter, Or,, and polynomial order, « max . We searched 
the {#o-«max} parameter space until a reasonable fit was ob- 
tained that kept n mm as small as possible. For all three bands, 
«max = 1 3 gives a reduced x 2 «l. No singular values is found 
for any of the fits. The parameters Op, we use for each frequency 
band are listed in Table 1. 


where m and n are integers such that n > 0, n > \m\ and n- 
\m\ is even. In the case of an azimuthally symmetric beam, we 
need only consider the m = 0 radial polynomials, which can be 
expressed in terms of Legendre polynomials, ,P„(.t), as follows: 


4.3. Window Functions and Their Covariances 
Given the amplitudes a„ of the radial beam profile fitted to 
the basis functions and the covariance matrix C“" between the 
amplitudes a m and a„, the beam Legendre transform is: 


R° 2n (p) = Pn(2p 2 -l). (22) 

The radial Zemike polynomials have a convenient analytic form 
for their Fourier transform: 


SU*)-j pdpe lo() lt\ n {p) = (-1 )"J 2 „+\(0)/d, (23) 


where,/,, is a Bessel function of the first kind. Motivated by this, 
we adopt 


b,m= 



fiti+X 



(24) 


be = 'Y^ a „bt n . (25) 

«= o 

and the covariance matrix of the beam Legendre transforms b £ 
and bp/ is: 


yb = SF dfrf r „ a ' <>h, 

^ da m mn da„ 

m.n - 0 

36 It may be asked why the Airy pattern, which was somewhat suitable for the 
high-0 fit in Section 3.2, is not used here. We find that at low 6 , it is a poor fit 
since the optics are more complicated than the perfect-aperture model assumed 
by (he Airy pattern. 
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Fig. 5.- — Normalized window functions (top) and diagonal errors (bottom) 
computed from the basis functions for each of the three frequency bands. The 
window functions have been normalized to unity at l = 0. In practice, the nor- 
malization will take place over the range of multipoles corresponding to the best 
calibration. Only statistical errors are shown. 

and therefore the covariance for the window function is: 

(27) 

In Figure 5 we show the window functions for each of the 
three frequency bands with diagonal error bars taken from the 
covariance matrix, . We observe that the window function 
for each of the frequency bands has fallen to less than 1 5% of 
its maximum value at t = 10000. The statistical diagonal errors 
are at the 1 .5%, 1 .5%, and 6% levels for the 1 48 GHz, 2 1 8 GHz, 
and 277 GHz bands respectively, as shown in Figure 5 . They are 
computed following Equation (17) of Page et al. (2003). The 
off-diagonal terms in the beam covariance matrix are compa- 
rable in magnitude to the diagonal terms. Singular value de- 
compositions of YJ\ ( , yield only a handful of modes with singu- 
lar values larger than 10“ 3 of the maximum values: five modes 
for 148 GHz, four for 218 GHz, and two for 277 GHz. Thus, 
the window function covariances can be expressed in a compact 
form which will be convenient for power spectrum analyses. 

For 277 GHz we estimate a 10% systematic uncertainty from 
destriping. Another source of systematic error in the window 
functions arises from the beams not being perfectly symmetric. 
The symmetrized beam window function generally underesti- 
mates the power in the beam (e.g., Figure 23, Hinshaw et al. 
2007). In practice, the scans in our survey field are cross-linked 
(see Section 5 . 1 ). Thus, given the spherical transform of a cross- 
linked beam map bi, the window function of relevance for the 
power spectrum calculation is: 

We = hT b Wb*(t)dfr, (28) 

where <t>i is the polar angle in spherical harmonic space. We 
computed the fractional difference between the window func- 
tion derived from the Legendre transform of the symmetrized 
beam (shown in Figure 5) to that derived from an estimated two- 
dimensional, cross-linked beam map. For the latter, we rotated 
our Saturn beam map by a typical cross-linking angle of 60°, 
coadded it with the original, and evaluated Equation ( 28 ) on the 
transform of this synthetic map. The difference between the two 
was found to be at the 1% level for the 148GHz and 218GHz 
arrays, and at the 4% level for the 277 GHz array. 

5. SZ GALAXY CLUSTERS 

In addition to beam maps, the Cottingham method mapper has 
been used for making maps of SZ clusters. The maps and anal- 
ysis presented in this section are the first results from ACT on 


SZ science. For this first overview, we include results from only 
the 148 GHz band, the most sensitive during our 2008 season. 
Of the five clusters presented in this paper, all of which were 
previously known, three are detected for the first time with the 
SZ effect. 

5.1. Data 

Table 2 lists the clusters studied in this paper, including infor- 
mation on the maps and a summary of the results of our analysis 
(Section 5 , 2 , Section 5 . 3 ). Figure 6 shows the cluster maps and 
companion difference maps (see below). 

Apart from planets, ACT has done no targeted observations 
of specific objects, so the cluster maps come from our regular 
survey data, which were taken at two different central azimuth 
pointings, one on the rising sky and the other on the setting sky. 
Therefore, the maps presented here are “cross-linked”, i.e., they 
consist of data taken with two distinct angles between the az- 
imuthal scan direction and the hour angle axis. The integration 
time is short, ranging from about 3 to 1 1 minutes — see Table 2. 

The clusters were found in a full-survey 148 GHz map pro- 
duced by our main map-maker. A Wiener filter was constructed 
using the polytropic model of Komatsu & Seljak (2001) as an 
SZ template, and included detector noise, CMB power and point 
source contributions in the noise model. Clusters were then 
identified from the filtered maps. We make two points about 
the clusters presented in this paper: first, although our template- 
based detection method has some built-in bias, the detections 
presented here are significant (> 5cr in the filtered survey map); 
and second, we have only included a sample of our significant 
detections. 

Cluster maps are made using the procedures outlined in Sec- 
tion 2.5 and Section 2 . 6 . The knot spacing was 7 > = 0.5s and 
the downsampling fractions were n p « 0.42 and «/, « 0.40. All 
maps are 0.°4 in diameter. Straight-line stripe removal (Sec- 
tion 2 , 6 ) has been performed, using a 6' radius mask over the 
cluster decrement. 

We have made companion “difference” maps for each cluster 
from the same data. For each of the rising and setting observa- 
tions, a map made from the first half of the season’s data is sub- 
tracted from the second half. The rising and setting difference 
maps are then coadded to produce the full, cross-linked coadded 
difference map — the same procedure used for the signal maps. 

The map noise, listed in Table 2 , is the rms of the map com- 
puted outside a 6' mask and converted to an effective pixel size 
of 1 aremin 2 . By examining the power spectra of the maps we 
found that the rms values we quote are dominated by the white 
noise level and do not have significant contributions from resid- 
ual low-frequency power. 

5.2. Cluster Profiles 

The cluster center positions are determined by finding the 
coldest point in the map smoothed with a 2' FWHM Gaussian 
kernel. As a rough guide, we also quote the cluster depths, 
A7sz, from these smoothed maps in Table 2 , but we stress that 
these values should not be used for quantitative analysis. All 
other cluster properties are measured from unsmoothed maps. 

Figure 7 shows the clusters’ radial temperature profiles, calcu- 
lated by finding the mean map temperatures in 32" wide annuli 
about the cluster centers listed in Table 2 . The errorbars on the 
profiles plotted in Figure 7 are the standard errors of these mean 
values. 

To highlight the size of the SZ decrements, 24 additional pro- 
files are included in Figure 7 , each from a map of a patch of 
the sky containing no clusters and processed in the same way 
as the cluster maps. Furthermore, to illustrate that the depth 
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of the cluster profiles is much greater than the fluctuations in 
the primary CMB anisotropies, we created a large ensemble 
( 1 000) of noise- free, primary-anisotropy CMB simulations us- 
ing a flat ACDM cosmology. These synthetic maps were made 
at the same size and resolution as our signal maps, and under- 
went the same stripe removal process. Figure 7 shows shaded 
areas which contain 68% and 95% of the simulated CMB radial 
profiles. The cluster profiles fall significantly below both the 
simulated CMB-only profiles and the profiles from blank CMB 
patches. 

As a check that the choice of knot spacing ( Tk = 0.5) is 
not creating a significant bias via covariance of the celestial 
signal with the low-frequency atmospheric estimate (see Sec- 
tion 2.4), we created maps with Tk from 0. 1 5 s to 1.5 s for ACT- 
CL J0245-5301 and ACT-CL J0638-5358. The temperature 
profiles for the latter are plotted in the middle panel of Figure 8. 
Even the shortest spacing does not produce a profile which is 
significantly different from the others. We conclude that the re- 
sults are not biased by having knots of too high a frequency. 

5.3. Integrated Compton-y Values 

The SZ effect occurs when CMB photons inverse 
Compton-scatter off hot electrons in clusters of galaxies 
(Zeldovich & Sunyaev 1969; Sunyaev & Zel’dovich 1970). 
The imprint on the CMB is proportional to the integrated 
electron gas pressure: 

jr~-~ = vf(x); y=^% f dln e T ek (29) 

Tcmb m e c- J 

where the integral is along the line of sight, m e , n e , and T e are the 
electron mass, number density, and temperature, respectively, 
( tt is the Thomson scattering cross-section, and the variable y 
is the Compton-y parameter. The function f(x) encodes the de- 
pendence on frequency: 

f(x) = [xcothCr/2) - 4] [ 1 + S sz (x. 7),)] , (30) 

with x = his/kgTcMB- The relativistic term S$z becomes impor- 
tant at higher temperatures (Rephaeli 1995), and is taken into 
account in our measurements below. 

A robust measure of the SZ signal is the integrated Com pton-y 
parameter, since it is model-independent and simply sums pixels 
in the maps (Verde et al. 2002; Benson et al. 2004): 


1 E 0657-56 (Bullet) Difference 
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Fig. 6. — Cluster maps made using the Cottingham method at 148 GHz. paired 
with their difference maps (see Section 5.1). The coordinates are J2000 right 
ascension (hours) and declination (degrees). The color bars are yK (CMB); 
note that the scale is different for each cluster. The gray disk in the top comer of 
the the signal plots is 2(43 in diameter, the FWHM size of the beam convolved 
with the Gaussian smoothing kernel which was applied to these images. In 
each difference plot, a cross shows the coordinates of the darkest spot in its 
corresponding signal map. 


m = JJ dflg'v(0'), (31) 

\0’\<e 

where 6 is the angular distance from the cluster center. We use 
steradians as the unit of solid angle, so Y is dimensionless. As 
an example, it is plotted for ACT-CL J0638-5358 in the lower 
panel of Figure 8. The values of Y at 2', 4', and 6' are shown for 
each cluster in Table 2. 

For clusters with measured temperatures (cf. Table 3), our 
Y values include the relativistic correction using the formulae 
from Nozawaetal. (2006). At 148 GHz, the corrections in- 
crease Y and, for the clusters in this paper, range from 4% (ACT- 
CL J0330-5228) to 7% (ACT-CL J0658-5556). Thus, the Y 
value quoted for ACT-CL .10546-5346, for which there is no 
measured temperature, is biased low, though we note that the 
relativistic corrections for the other clusters are smaller than the 
uncertainty of our measurements (see below). 

There are two statistical sources of uncertainty in the mea- 
sured Y values: instrumental and atmospheric noise in the map, 
and confusion of primary CMB anisotropies with the SZ signal. 
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Fig. 7. — Radial cluster profiles, calculated by finding the mean temperature in the maps in 32" wide annuli. For comparison, the radial profiles for the 24 maps of 
random patches of the sky, not containing clusters, are included; additionally, shaded areas show the range containing 68% and 95% of profiles from an ensemble of 
1000 simulated, noise-free CMB patches (cf. Section 5.3). The locations of the profile centers are listed in Table 2. The 1(4 FWHM beam has not been deconvolved 
from any of the profiles. 


TABLE 2 

Selection of SZ Clusters Detected by ACT 


ACT Descriptor 

Catalog Name 

J2000 Coordinates 3 

rms b 

(MK) 

/. C 
unt 

(min) 

A T sz d 
(UK) 

10 10 x Y(8) e 

RA 

Dec. 


G<2‘ 

(±0.2) 

0< 4' 
(+0.6) 

0<6' 

(±1.1) 

ACT-CL J0245-5301 

AS0295 

02 b 45 m 28 s 

-53 c 01'36" 

44 

10.1 

-250 

0.93 

2.5 

4.1 

ACT-CL J0330-5228 

A3 128 (NE) 

03 h 30 m 50 s 

-52°28'38" 

49 

10.3 

-260 

0.97 

2.8 

4.5 

ACT-CL J0546-5346 

SPT-CL 0547-5345 

05 h 46 m 35 s 

— 53°46 / 04" 

46 

9.5 

-250 

0.96 

2.5 

3.9 

ACT-CL J0638-5358 

AS0592 

06 h 38 m 46 s 

-53°58'40" 

55 

7.5 

-230 

0.74 

1.5 

2.2 

ACT-CL J0658-5556 

IE 0657-56 (Bullet) 

06 h 58 m 33 s 

-55° 56'49" 

80 

3.4 

-510 

1.70 

3.20 

3.80 


a Position of the deepest point in 2' FWHM Gaussian smoothed map. 
b Map rms measured outside a 6' mask and reported for a 1 arcmin 2 area. 

c Integration time, defined as the approximate total time (in minutes) that the telescope was pointed in the map region. 
d Cluster depth, as measured in a 2' FWHM Gaussian smoothed map at the listed coordinates; intended as a guide to the magnitude of 
the decrement. 

e See Equation (31) and following discussion. 


The contribution from noise is readily estimated from the map 
rms (cf. Sec. 5.1 and Table 2). To estimate the second contri- 
bution, we found the standard deviation of Y in the ensemble 
of simulated, noise- free, primary CMB-only maps described in 
Section 5.2. This latter source dominates over the error from the 
map noise. Our estimated 1 <t errors, including both sources of 
uncertainty, are 0.2xl0~ 10 , 0.6xl0 -I ° and l.lxKT 10 for Y at 2', 
4', and 6'. We arrive at uncertainties that agree to better than 
10% by calculating the standard deviation of Y values from the 
24 maps of random patches of the sky with no clusters present 
(cf. Section 5.2). 

Uncertainty in the cluster temperature enters into the measure- 
ment of Y via the relativistic correction (see above). However, 
the error introduced is less than a percent and is therefore in- 
significant in comparison to the contributions from noise and 
primary CMB anisotropies. 

5.4. Comparisons to Previous Measurements 

The clusters shown in this paper are previously known X-ray, 
optical, and/or SZ clusters; all are massive systems. For three of 
the sources (AS0295, A3 128 (NE), and AS0592), these are the 
first reported SZ detections. In this section, we briefly review 
measurements from the literature to provide context, and point 


out some of the contributions that our new measurements make 
to this body of knowledge. 

Relevant parameters from the literature are listed in Table 3; 
references for these values are included below. Typical errors on 
Lx are small (< 20%), while those on the inferred mass are more 
substantial (~50%). Temperatures are measured values from 
X-ray spectra. We use a flat ACDM cosmology with flu = 0.3 
and Ho = 70kms~' Mpc -1 . Masses are quoted in units of Msoo, 
defined as the mass within a radius having a mean mass density 
(p) 500 times greater than the critical density, i.e., (p) = 500 x 
3H 2 /(SttG). In the following we briefly discuss the clusters in 
the order in which they appear in Table 2. 

5.4.1. AS0295 

AS0295 first appeared in Abell et al. (1989) in their table of 
supplementary southern clusters (i.e., clusters that were not rich 
enough or were too distant to satisfy the criteria for inclusion 
in the rich nearby cluster catalog). It was also found to be a 
significant X-ray source in the ROSAT All Sky Survey (RASS) 
Bright Source Catalog (Voges et al. 1999). The spectroscopic 
redshif! of AS0295 was obtained by Edgeetal. (1994), who 
also reported the discovery of a giant strong-Iensing arc near the 
brightest cluster galaxy. Efforts to detect the SZ effect at 1 .2 mm 
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TABLE 3 

SUMMARY OF CLUSTER PROPERTIES FROM X-R.4Y AND OPTICAL STUDIES 


ACT Descriptor 

Catalog Name 

Redshift 

Da 

ix(0. 1-2.4 keV) 

A /500 

kT 

10 10 X r 25 oo a 




[Mpc] 

( 1 0 44 ergs -1 ) 

(10 15 a/©) 

(keV) 


ACT-CL J0245-5301 

AS0295 

0.3006 

920 

8.3 

0.8 

6.7±0.7 


ACT-CL J0330-5228 

A3 128 (NE) 

0.44 

1172 

3.9 

0.3 

5.1 ±0.2 

0 . 15 JI 

ACT-CL J 0546-5346 

SPT 0547-5345 

0.88 (P) 

1596 

4.7 

0.6 

— 


ACT-CL J0638-5358 

AS0592 

0.2216 

737 

10.6 

1.0 

8.0 ±0.4 

131L8.II 

ACT-CL J0658-5556 

IE 0657-56 

0.296 

910 

20.5 

1.4 

1 0.6 d= 0. 1 


NOTE. — See Section 5.4 for citations to the literature from which these values were obtained. The marker (P) in the redshift 
column indicates a photometric redshift measurement. 


a Predicted value of Y within R 2500 from the Y -kT scaling relation of Bonamente et al. (2008). Errors come from the uncertainty 
on the scaling relation parameters. Although we do not have Risoo values for our clusters, the T( 2') measurements listed in 
Table 2 should be roughly comparable to these — see Section 5 4.6. 



0 2 4 6 8 10 

Radius (arcmin) 

Fig. 8.- - The radial profile (top/middle) and integrated Compton Y(8) values 
(bottom) for the SZ decrement of ACT-CL J0638-5358 (AS0592). The pro- 
file data are averages from the maps in 32" wide annuli, and Y(6) is the sum 
of the pixels within a radius 8, converted to the unitless Compton-y parameter 
(Equations (29) and (31)). The top panel shows the profile of the signal map 
and difference map. The middle panel compares profiles for maps made with 
different knot spacings 7 >, showing that the choice of knot spacing does not 
significantly bias the measured cluster profile. In all of the panels, the profile 
centers were determined by the minimum of the map after smoothing with a 2' 
FWHM Gaussian profile. (The profiles were calculated from the unsmoothed 
maps.) 

and 2 mm with the Swedish-ESO Submillimetre Telescope were 
attempted, unsuccessfully, by Andreani et al. (1996). ASCA ob- 
servations (Fukazawa et al. 2004) yielded values (see Table 3) 
for average temperature and soft band X-ray flux(0. 1-2.4 keV), 
from which we determined the corresponding X-ray luminosity. 
The cluster mass M 500 was then estimated from the luminosity- 
mass (specifically Zy(0. 1-2.4 keV) versus M 500 ) relations from 
Reiprich & Bohringer(2002). 

5 . 4 . 2 . A3128 (NE) 

Until quite recently the north-east (NE) component of A3 1 28 
was believed to be part of the Horologium-Reticulum superclus- 
ter at z = 0.06. The X-ray morphology is clearly double peaked 
with the two components separated on the sky by some 12 '. 
Rose et al. (2002) estimated the virial masses of the two com- 
ponents assuming the redshift of the supercluster and obtained 
a value for each of ~1.5 x lO 14 A/ 0 . Figure 9 shows our SZ 
measurement with overlaid X-ray contours. 


Recently Werner et al. (2007) carried out a detailed study of 
this cluster using XMM-Newton data, which revealed a more 
distant and more massive cluster superposed on the northeast- 
ern component of A3 128. A significant portion of the X-ray 
emission comes from this background cluster. The values we 
quote in the table for redshift. X-ray luminosity, gas tempera- 
ture, and ATsoo correspond to the background cluster and come 
from Werner et al. (2007). 

The large SZ decrement seen in the ACT maps is clearly as- 
sociated with the NE component where the z = 0.44 cluster is. 
We do not detect a significant decrement from the southwestern 
component which lies at z = 0.06. Werner et al. (2007) estimate 
the temperature of the higher redshift cluster to be 5. 14 ± 0. 15 
keV, which is significantly hotter than that of the foreground 
cluster (kT = 3.36 ±0.04 keV). This system, therefore, is a 
compelling illustration of the mass selection, approximately in- 
dependent of redshift, of the SZ effect. Werner et al. (2007) 
note that the temperature, luminosity, and mass estimates of the 
z = 0.44 background cluster are all subject to large systematic 
errors, as the cluster properties depend upon the assumed prop- 
erties of the foreground system. A joint X-ray/SZ/optical analy- 
sis should be able to better constrain the characteristics of both 
systems and thereby contribute to assessing the mass threshold 
of the ACT cluster survey. 

5 . 4 . 3 . SPT 0547-5345 

The galaxy cluster SPT 0547-5345 was first discovered via 
the SZ effect by the South Pole Telescope (Staniszewski et al. 
2009). Its physical properties — photometric redshift, lumi- 
nosity, temperature and mass estimate — were subsequently re- 
ported by Menanteau & Hughes (2009) based on optical and X- 
ray data. 

SPT 0547-5345 has associated central elliptical galaxies with 
luminosities consistent with those of clusters in the Sloan Dig- 
ital Sky Survey. Its mass estimate from the optical and X-ray 
luminosity also suggests that it is a fairly massive system. In 
Table 3 we list its M 500 estimates from the X-ray luminosity. 

5 . 4 . 4 . AS0592 

The galaxy cluster AS0592 was originally detected optically 
(Abell etal. 1989). ROSAT detected it as a bright source in 
the All Sky Survey and its redshift (z= 0.2216) was reported 
in de Grandi et al. (1999). The cluster is also known by its 
REFLEX designation of RXC J0638. 7-5358 (Bohringer et al. 
2004). The ROSAT flux and luminosity in the soft X-ray band 
(0.1 -2.4 keV) are 7.5 x 10 ' 12 erg cm ' 2 s ' 1 and 1.1 x 10 45 ergs 
s -1 . The X-ray spectrum of AS0592 from a Chandra observa- 
tion (Hughes et al. 2009) yields an integrated gas temperature of 
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FIG. 9. — ACT-CL J0330-5228 (A3128 (NE)) with overlaid contours of X- 
ray emission in black. The SZ detection is associated with the NE feature of 
A3 128, and confirms that it is due to a more massive, higher redshift cluster 
than that at the SW lobe — a compelling example of the redshift independent 
mass selection of the SZ effect. The X-ray data come from two separate XMM- 
Newton observations (Obs Ids 0400130101 and 0400130201) with a total expo- 
sure time of 104 ks. The two observations were mosaicked into a single image 
over the 0.2— 2.0 keV. Contour values are from 1 .25x 1 0 H to 1 .25 x I0 7 pho- 
tons ctrr 2 s _l arcsec” 2 . 

kT = 8.0 ±0.4 keV. The soft X-ray luminosity implies a cluster 
mass of M5oo= 10 15 Mq. 

5.4.5. IE 0657-56 (Bullet Cluster) 

We detect IE 0657-56, the famous “Bullet” cluster, at high 
significance with a strong central decrement and large integrated 
Y. Previous detections of the millimeter-band SZ signal from 
this cluster have been reported by ACBAR (Gomez et al. 2004) 
and APEX-SZ (Halverson et al. 2008). 

The spectroscopic redshift of IE 0657-56 was obtained by 
Tucker et al. (1998), the X-ray flux came from the Einstein Ob- 
servatory (Markevitch et al. 2002), the X-ray gas temperature 
from XMM-Newton (Zhang et al. 2006), and the cluster mass, 
M 500 , from a study by Zhang et al. (2008). 

Figure 10 shows a zoomed-in plot of our SZ map with X-ray 
contours from Chandra and lensing contours from Clowe et al. 
(2007). As expected, the SZ decrement follows the X-ray con- 
tours more closely than the lensing contours, since the collision- 
less dark matter is expected to be offset from the collisional gas 
in this merging system. 

5.4.6. Comparison with Previous SZ Measurements 

Although the large masses of the ACT-detected clusters we 
report here offer strong support for the reality of our detec- 
tions, we also compare the quoted integrated Compton-y pa- 
rameters for consistency with expectations from previous SZ 
cluster studies. For this we use the Y-kT scaling relation from 
Bonamente et al. (2008) (using values for “all clusters” from 
their Table 2). Predicted values are given in the last column 
of Table 3. The Y values in the scaling relation were integrated 
within 7?25oo, the radius where the average cluster mass density 
is 2500 times the critical density. We do not have precise i? 25 oo 
values for our clusters, but estimates of /?2500 range from about 
1' to 3', so the predicted values of T(2500) should, to first or- 
der, be roughly comparable to our T(2') values. With that pro- 
viso, the predicted and measured Y values agree to within 2a 
for all clusters except A3 128 (NE), where the cluster temper- 
ature predicts a much lower Y value than we measure. Since 
it is a complex system, it could have a larger mass than pre- 


ACT-CL 0658-5557 (IE 0657-56/Bullet) 



06h59m00s 06h58m30s 


Right Ascension - J2000 

Fig. 10. — ACT-CL J0658-5556 (Bullet Cluster) with overlaid contours of 
X-ray emission (black) and dark matter distribution (orange). The X-ray con- 
tours come from an 85ks-long Chandra observation (Obs Id 3184) and corre- 
spond to the 0.5-2.0 keV band. Contour values are 4xl(T 7 to 2xl0~ 9 pho- 
tons cm -2 s" 1 arcsec -2 . The lensing data are from Clowe et al. (2007) with con- 
tours running from k = 0. 12 to 0.39. 

viously thought. A larger and better-studied sample of ACT- 
detected clusters will be necessary before drawing conclusions 
about scaling relations. 

6. CONCLUSIONS 

In this paper we have presented beam maps and SZ cluster 
measurements from the first full season of 148 GHz data from 
ACT. They represent the first scientific results from ACT and 
demonstrate that it is poised to make important contributions to 
millimeter astronomy. 

We have described a maximum-likelihood mapping algorithm 
which uses B-splines to model atmospheric signal and to remove 
it from the data. The method has been used to make high preci- 
sion (< -40dB) beam maps, with solid angles in the 148 GHz, 
218 GHz, and 277 GHz bands of (218.2±4)nsr, (1 18.2±3)nsr, 
and ( 1 04.2 ± 6)nsr, respectively. The beam profiles and window 
functions will be important for all subsequent analyses of ACT’s 
data. 

Additionally, we have made maps showing SZ decrements 
of five previously discovered galaxy clusters. Of these, three 
are detected for the first time via the SZ effect. Our high- 
significance detection of the z = 0.44 component of A3 128, and 
our current non-detection of the low-redshift part, corroborates 
existing evidence that the further cluster is more massive. This 
is a compelling example of the redshift-independent mass selec- 
tion of the SZ effect which will be exploited in future studies of 
ACT clusters. 

A preprint of this paper included three additional clusters: 
ACT-CL J0509-5345, ACT-CL J05 16-5432, and ACT-CL 
J0645-5413. They are instead quantified in Menanteau et al. 
(2010) and Marriage et al. (2010). We omitted them here in 
order to focus on the more significant of the initial ACT cluster 
detections. 
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APPENDIX 

A. THE COTTINGHAM METHOD AS A MAXIMUM-LIKELIHOOD ESTIMATOR 
Equation ( 1 ) can be written in the matrix form. 



which has the maximum-likelihood estimator: 


m 

a 


P T 

B r 


i-i 


N -1 ( P B ) 


P 7 

B r 


N ~'d = 


(PP) (PB) 
(BP) (BB) 


where we use the shorthand notation (XY) = X 7 N 1 Y. The inverted matrix evaluates to: 


P T N~'d \ 
B r N -1 «( ) ’ 


[(PP) - (PB)(BB )“ 1 (BP)] 1 -(PPr'(PB)[(BB)-(BP)(PPr 1 (PB)] ' 

-( BB)~ 1 (BP) [(PP) - < PB)(BB) 1 (BP)] [(BB) - (BP)(PP )- ; 1 (PB)] ' 

Since this matrix is symmetric, and (XY) r = (YX), we can rewrite the lower-left component as: 


(BBr’(BP) [(PP)-(PB)(BB)“ 1 (BP)] 1 = |(PP)“ 1 (PB) [(BB)-(BP)(PP) _ 1 (PB)] ! | r 

= [(BB) - (BP)(PP) H (PB)] (BP)(PP) ! . 

Thus, the solution for the atmosphere is: 


(Al) 


(A2) 


(A3) 


(A4) 

(A5) 


a = [(BB)-(BP)(PP) _1 (PB)] ' B r N”'t/ - [(BB)-(BP)(PPr'(PB)] ' (BP)(PP)“ 1 P r N _ 1 al. (A6) 

To show that this is equivalent to the solution presented in Section 2.1, we observe that the definitions in Equations (3) and (6) of 
Section 2. 1 can be recast: 


0 = B r N ‘(l-Pn)B = (BB)-(BP)(PPr'(PB). 
0 =B r N^‘(l-Pn)f/ = [B r N _1 — (BP)(PP) _l P r N“'] d. 

This reduces Equation (A6) to: 


5 = 0” 1 0, 


(A7) 


(A8) 


which is the same as Equation (7) of Section 2. 1 
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B. A PLANET'S SOLID ANGLE CONTRIBUTION TO THE BEAM SOLID ANGLE MEASUREMENT 

Denote the instrument response with P(n) and the power emitted by the planet with Po'i'(n), where is the peak power emitted and 
Tf is a normalized distribution describing its shape. The coordinate n is a two dimensional vector describing the position on the sky, 
with n — 0 at the planet center. The measured beam map, B. is the convolution of the true beam, B. with the planet: 


P(n) _ ff dH n >B(n-n')^(rt') 

W) - ffdn„B(-n')*(n') ' 


The measured solid angle is then (cf. Equation ( 1 3)): 


(Bl) 


f/MM-n'm, 1 ) = [ff di V$(»Q] [ffdH n B(n)\ = _ 

ff d^B(-n')^(n') ff dQ.„iB(n')'^(n') ff dH n ,B(n')^(n'y 1 ' 

where in the second equality we brought the denominator outside the outer integral, and in the numerator we switched the order of 
integration and then shifted the dummy variable for the integral over B. In the last equality we recognized that the integrals in the 
numerator evaluate to the solid angles of the planet and the true instrument beam, respectively. If the planet is much smaller than the 
beam, we can expand the beam appearing the integrand of the denominator in a Taylor series: 



B(n) = 1 + VR(0) ■ n + ]-n ■ H(0) •« + •••, (B3) 

where H is the Hessian matrix of the beam. At the beam center, being the peak, the gradient vanishes. If we assume a symmetric 
beam, then n • H n = ( V 2 5/2)|«| 2 . and we can write: 

q ~ 

A ~ ffdH„, [l + 1 V 2 S(0)|n'p] tf(n') 

where [if is the second raw moment of the planet shape Tl In the small planet approximation we are making, the second term in the 
brackets is small. Thus: 


v z B( o) r 
' 411 * JJ 


dil. 


|n / | 2 ']/(w , ) 


H A 


, V 2 S( 0) * 


(B4) 


n, as fij 


H a V 2 5( 0) * 
T~ P'2 ■ 


H 


(B5) 


For a disk, [if = / 2tt , and both a Gaussian beam and an Airy pattern have V 2 R(0) = -4 tt/Hj. (For the Airy pattern, this is easiest 

to see by expanding the Bessel function in a power series and differentiating.) Thus, we have the result that Q A « il A +fl*/2. 
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